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PCT09 



RAW SEQUENCE LISTING DATE: 10/24/2002 

PATENT APPLICATION: US/09/744 , 3 13A TIME: 16:11:46 

Input Set : A:\pf0706usn_seqlist.txt 
Output Set: N:\CRF4\10242002\I744313A.raw 



1 <110> APPLICANT: YUE, Henry 

2 TANG , Y. Tom 

3 AZIMZAI, Yalda 

5 <120> TITLE OF INVENTION: SORTING NEXINS 
7 <130> FILE REFERENCE: PF-0706 USN 
9 <140> CURRENT APPLICATION NUMBER: 09/744, 313A 
C--> 10 <141> CURRENT FILING DATE: 2002-09-20 

12 <150> PRIOR APPLICATION NUMBER: PCT/US00/14 831 

13 <151> PRIOR FILING DATE: 2000-05-26 

15 <150> PRIOR APPLICATION NUMBER: 60/136,740 

16 <151> PRIOR FILING DATE: 1999-05-27 

18 <150> PRIOR APPLICATION NUMBER: 60/139,566 

19 <151> PRIOR FILING DATE: 1999-06-16 

21 <160> NUMBER OF SEQ ID NOS : 4 

22 <170> SOFTWARE: PERL Program 

24 <210> SEQ ID NO: 1 

25 <211> LENGTH: 465 

26 <212> TYPE: PRT 

27 <213> ORGANISM: Homo sapiens 
29 <220> FEATURE: 

30 



31 


<223> OTHER 


INFORMATION: 


Incyte 


ID No: 2124842CD1 






33 


<400> SEQUENCE: 


1 
















Thr 


Gin 


Lys 


34 


Met 
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Leu 


He 


His 
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He 
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Arg 
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35 
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36 


Arg 


Gly 


Glu 


Ser 


Phe 


Gly 


He 


Ser 


Arg 


lie 


Gly 


Ser 


Lys 


He 


Lys 


37 






20 










25 










30 


38 


Gly Val 


Phe 


Lys 


Ser 


Thr 


Thr 


Met 


Glu 


Gly 


Ala 


Met 


Leu 


Pro 


Asn 


39 










35 










40 










45 


40 


Tyr 


Gly 


Val 


Ala 


Glu 


Gly 


Glu 


Asp 


Asp 


Phe 


He 


Glu 


Glu 


Gly 


He 


41 






50 










55 










60 


42 


Val 


Val 


Met 


Glu 


Asp 


Asp 


Ser 


Pro 


Val 


Glu 


Ala 


Val 


Ser 


Thr 


Pro 


43 










65 










70 










75 


44 


Asn 


Thr 


Pro 


Arg 


Asn 


Leu 


Ala 


Ala 


Trp 


Lys 


He 


Ser 


He 


Pro 


Tyr 


45 








80 










85 










90 


46 


Val 


Asp 


Phe 


Phe 


Glu 


Asp 


Pro 


Ser 


Ser 


Glu 


Arg 


Lys 


Glu 


Lys 


Lys 


47 
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100 
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48 


Glu 


Arg 


He 


Pro 


Val 


Phe 


Cys 


He 


Asp 


Val 


Glu 


Arg 


Asn 


Asp 


Arg 


49 








110 










115 
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50 


Arg 


Ala 


Val 


Gly 


His 


Glu 


Pro 


Glu 


His 


Trp 


Ser 


Val 


Tyr 


Arg 


Arg 


51 






125 










130 
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52 


Tyr 


Leu 


Glu 


Phe 


Tyr 


Val 


Leu 


Glu 


Ser 
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Leu 


Thr 


Glu 


Phe 


His 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/744 , 313A 



DATE: 10/24/2002 
TIME: 16:11:46 



Input Set : A:\pf0706usn_seqlist.txt 
Output Set: N:\CRF4\10242002\I744313A.raw 



54 Gly Ala Phe Pro Asp Ala Gin Leu Pro Ser Lys Arg He He Gly 

55 155 160 165 

56 Pro Lys Asn Tyr Glu Phe Leu Lys Ser Lys Arg Glu Glu Phe Gin 

57 170 175 180 

58 Glu Tyr Leu Gin Lys Leu Leu Gin His Pro Glu Leu Ser Asn Ser 

59 185 190 195 

60 Gin Leu Leu Ala Asp Phe Leu Ser Pro Asn Gly Gly Glu Thr Gin 

61 200 205 210 

62 Phe Leu Asp Lys He Leu Pro Asp Val Asn Leu Gly Lys He He 

63 215 220 225 

64 Lys Ser Val Pro Gly Lys Leu Met Lys Glu Lys Gly Gin His Leu 

65 230 235 240 

66 Glu Pro Phe He Met Asn Phe He Asn Ser Cys Glu Ser Pro Lys 

67 245 250 255 

68 Pro Lys Pro Ser Arg Pro Glu Leu Thr He Leu Ser Pro Thr Ser 

69 260 265 270 

70 Glu Asn Asn Lys Lys Leu Phe Asn Asp Leu Phe Lys Asn Asn Ala 

71 275 280 285 

72 Asn Arg Ala Glu Asn Thr Glu Arg Lys Gin Asn Gin Asn Tyr Phe 

73 290 295 300 

74 Met Glu Val Met Thr Val Glu Gly Val Tyr Asp Tyr Leu Met Tyr 
75- 305 310 315 

76 Val Gly Arg Val Val Phe Gin Val Pro Asp Trp Leu His His Leu 

77 320 325 330 

78 Leu Met Gly Thr Arg He Leu Phe Lys Asn Thr Leu Glu Met Tyr 

79 335 340 345 

80 Thr Asp Tyr Tyr Leu Gin Cys Lys Leu Glu Gin Leu Phe Gin Glu 

81 350 355 360 

82 His Arg Leu Val Ser Leu He Thr Leu Leu Arg Asp Ala He Phe 

83 365 370 375 

84 Cys Glu Asn Thr Glu Pro Arg Ser Leu Gin Asp Lys Gin Lys Gly 

85 380 385 390 

86 Ala Lys Gin Thr Phe Glu Glu Met Met Asn Tyr He Pro Asp Leu 

87 395 400 405 

88 Leu Val Lys Cys He Gly Glu Glu Thr Lys Tyr Glu Ser He Arg 

89 410 415 420 

90 Leu Leu Phe Asp Gly Leu Gin Gin Pro Val Leu Asn Lys Gin Leu 

91 425 430 435 

92 Thr Tyr Val Leu Leu Asp He Val He Gin Glu Leu Phe Pro Glu 

93 440 445 450 

94 Leu Asn Lys Val Gin Lys Glu Val Thr Ser Val Thr Ser Trp Met 

95 455 460 465 

98 <210> SEQ ID NO: 2 

99 <211> LENGTH: 450 

100 <212> TYPE: PRT 

101 <213> ORGANISM: Homo sapiens 

103 <220> FEATURE: 

104 <221> NAME/KEY: misc_f eature 

105 <223> OTHER INFORMATION: Incyte ID No: 5215690CD1 
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107 <400> SEQUENCE: 2 

108 Met Glu Gin Ala Pro Pro Asp Pro Glu Arg Gin Leu Gin Pro Ala 

109 15 10 15 

110 Pro Leu Glu Pro Leu Gly Ser Pro Asp Ala Gly Leu Gly Ala Ala 

111 20 25 30 

112 Val Gly Lys Glu Ala Glu Gly Ala Gly Glu Glu Ser Ser Gly Val 

113 35 40 45 

114 Asp Thr Met Thr His Asn Asn Phe Trp Leu Lys Lys lie Glu lie 

115 50 55 60 

116 Ser Val Ser Glu Ala Glu Lys Arg Thr Gly Arg Asn Ala Met Asn 

117 65 70 75 

118 Met Gin Glu Thr Tyr Thr Ala Tyr Leu lie Glu Thr Arg Ser Val 

119 80 85 90 

120 Glu His Thr Asp Gly Gin Ser Val Leu Thr Asp Ser Leu Trp Arg 

121 95 100 105 

122 Arg Tyr Ser Glu Phe Glu Leu Leu Arg Ser Tyr Leu Leu Val Tyr 

123 110 115 120 

124 Tyr Pro His lie Val Val Pro Pro Leu Pro Glu Lys Arg Ala Glu 

125 125 130 135 

126 Phe Val Trp His Lys Leu Ser Ala Asp Asn Met Asp Pro Asp Phe 

127 140 145 150 

128 Val Glu Arg Arg Arg lie Gly Leu Glu Asn Phe Leu Leu Arg lie 

129 155 160 165 

130 Ala Ser His Pro lie Leu Cys Arg Asp Lys lie Phe Tyr Leu Phe 

131 170 175 180 

132 Leu Thr Gin Glu Gly Asn Trp Lys Glu Thr Val Asn Glu Thr Gly 

133 185 190 195 

134 Phe Gin Leu Lys Ala Asp Ser Arg Leu Lys Ala Leu Asn Ala Thr 

135 200 205 210 

136 Phe Arg Val Lys Asn Pro Asp Lys Arg Phe Thr Asp Leu Lys His 

137 215 220 225 

138 Tyr Ser Asp Glu Leu Gin Ser Val lie Ser His Leu Leu Arg Val 

139 230 235 240 

140 Arg Ala Arg Val Ala Asp Arg Leu Tyr Gly Val Tyr Lys Val His 

141 245 250 255 

142 Gly Asn Tyr Gly Arg Val Phe Ser Glu Trp Ser Ala lie Glu Lys 

143 260 265 270 

144 Glu Met Gly Asp Gly Leu Gin Ser Ala Gly His His Met Asp Val 

145 275 280 285 

146 Tyr Ala Ser Ser lie Asp Asp lie Leu Glu Asp Glu Glu His Tyr 

147 290 295 300 
14 8 Ala Asp Gin Leu Lys Glu Tyr Leu Phe Tyr Ala Glu Ala Leu Arg 

149 305 310 315 

150 Ala Val Cys Arg Lys His Glu Leu Met Gin Tyr Asp Leu Glu Met 

151 320 325 330 

152 Ala Ala Gin Asp Leu Ala Ser Lys Lys Gin Gin Cys Glu Glu Leu 

153 335 340 345 

154 Val Thr Gly Thr Val Arg Thr Phe Ser Leu Lys Gly Met Thr Thr 

155 350 355 360 
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156 


Lys 
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Phe 


Gly 


Gin 


Glu 


Thr 


Pro 


Glu 


Gin 


Arg 


Glu 


Ala 


Arg 


He 
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370 










375 


158 


Lys 


Val 


Leu 


Glu 


Glu 


Gin 


He 


Asn 


Glu 


Gly 


Glu 


Gin 


Gin 


Leu 


Lys 


159 










380 










385 










390 


160 


Ser 


Lys 


Asn 


Leu 


Glu 


Gly 


Arg 


Glu 


Phe 


Val 


Lys 


Asn 


Ala 


Trp 


Ala 


161 










395 










400 










405 
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Asp 


He 


Glu 


Arg 


Phe 


Lys 


Glu 


Gin 


Lys 


Asn 


Arg 


Asp 


Leu 


Lys 


Glu 


163 










410 










415 










420 


164 


Ala 


Leu 


He 


Ser 


Tyr 


Ala 


Val 


Met 


Gin 


He 


Ser 


Met 


Cys 


Lys 


Lys 


165 










425 










430 










435 


166 


Gly 


He 


Gin 


Val 


Trp 


Thr 


Asn 


Ala 


Lys 


Glu 


Cys 


Phe 


Ser 


Lys 


Met 


167 
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170 <210> SEQ ID NO: 3 

171 <211> LENGTH: 1992 

172 <212> TYPE: DNA 

173 <213> ORGANISM: Homo sapiens 

175 <220> FEATURE: 

176 <221> NAME/KEY: misc_f eature 

177 <223> OTHER INFORMATION: Incyte ID No: 2124842CB1 

179 <400> SEQUENCE: 3 

180 gtatgaaaac tccaaaagtt atgttagcta tacctttagt ttatcatttt caaaactgtt 60 

181 tttctttttt tttaataatg tttttcaata aactagactg ttgtgataat ttgctatgta 120 

182 cttgatacat ttttgtttaa tattcaggaa cacacagaaa aggggagaat catttggaat 180 

183 cagcagaata ggtagcaaaa ttaaaggagt attcaaaagt accacaatgg agggagctat 240 

184 gttgcctaat tatggtgtag ctgaaggtga agatgatttt attgaagaag gtattgttgt 300 

185 aatggaagat gattctccag tggaggctgt gagcacacct aatactcccc gaaaccttgc 360 

186 tgcatggaaa attagcattc catatgtaga cttttttgag gatccctcct ctgaaaggaa 420 

187 ggagaaaaaa gaaagaattc ctgtgttttg tattgatgtt gaaagaaatg atagaagagc 480 

188 agttggacac gagcctgaac attggtctgt ctatagaaga tatcttgaat tctatgtact 54 0 

189 tgaatcaaaa ctaacagaat ttcatggtgc atttcctgat gcccagcttc cttctaagag 600 

190 gatcattggc cccaaaaatt atgaattctt aaagtcaaag agggaagagt tccaagaata 6 60 

191 tctacagaaa cttctgcagc atccagaact gagtaatagt caacttctgg cagactttct 720 

192 ttcccctaat ggtggggaaa cacaatttct tgataagata ctaccagatg taaatcttgg 780 

193 gaaaattata aaatctgttc ctggaaaact aatgaaagag aaaggtcagc atttggaacc 840 

194 ttttatcatg aatttcatta attcttgtga gtctccaaag cctaaaccaa gtagaccaga 900 

195 actgaccatt ctcagcccta cttcagaaaa caacaagaag cttttcaatg atctgtttaa 960 

196 aaataatgca aaccgtgctg aaaatacaga gagaaagcaa aatcagaatt attttatgga 1020 

197 ggtgatgact gtagaaggag tctatgatta cctgatgtat gtaggacggg tagttttcca 1080 

198 ggttcctgac tggcttcatc atctcttaat gggaactcga atcctcttta aaaacaccct 1140 

199 ggaaatgtat actgattact atcttcagtg taaactagaa cagctatttc aggagcaccg 1200 

200 tttggtctca ctcataacac ttctcagaga tgctatattc tgtgaaaaca ctgaacctcg 12 60 

201 ctctctccaa gataagcaaa aaggagcaaa acagactttt gaagaaatga tgaattacat 1320 

202 tccagatctg ttagtcaagt gtattggtga agaaaccaag tatgaaagca tcagacttct 1380 

203 gtttgatggc ttacagcaac cagtactcaa caagcagctg acttatgttt tattggacat 1440 

204 tgtgatacag gaactgtttc cagagctcaa taaggtacaa aaggaagtta cctctgtgac 1500 

205 atcttggatg taaacacttg gatttggtat agaataaccc attgaaattt ctgctgtgcg 1560 

206 agggtggtag aaatttactt ttttgggtat attcttatat atattatgta catcgctgtc 1620 

207 tgaaatttta gttatttttt gtttttaata aagactaaca caaacttaat gattaaaagt 1680 

208 gattgagtct catagtcttt catttgctag ctgtgatcca aattttatta gaacataagt 1740 
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209 cacttgttat tgccattttt aaaagagaaa 

210 gactgataaa cttcgtattg tatagctttg 

211 gaataagatc tgattttctt agagttaata 

212 attttgtaca tagttaactg tgtatctata 

213 taaaaaaaaa aa 

215 <210> SEQ ID NO: 4 

216 <211> LENGTH: 2507 

217 <212> TYPE: DNA 

218 <213> ORGANISM: Homo sapiens 
-220 <220> FEATURE: 

221 <221> NAME/KEY: misc_f eature 

222 <223> OTHER INFORMATION: Incyte 
224 <400> SEQUENCE: 4 

22 5 cgcgccgaac tgcagccatg gagcaggcac 

226 cgcccttgga gccgctgggc tccccagacg 

227 cggagggggc cggagaagag agctctgggg 

228 tgaagaagat agaaatcagt gtttcagaag 

229 acatgcaaga aacatatact gcttacctca 

230 gtcagagtgt cctaacagac tcactatggc 

231 gctacctttt agtttactat ccacatattg 

232 aatttgtttg gcataaactc tctgctgaca 

233 ggattggttt agaaaacttt ctcttgagga 
2 34 aaatcttcta tctgttttta acacaggaag 

23 5 ggtttcagct gaaggcagac tccaggttaa 

236 acccagacaa gagatttact gaccttaagc 

237 cacatcttct tcgagtcaga gctagagtag 

238 atgggaatta tggtcgagtt ttcagtgaat 

239 gactgcagag tgctggtcat catatggatg 

240 aagatgaaga acattatgca gatcagttaa 

241 gggctgtgtg caggaaacat gaacttatgc 

242 tagcatccaa gaagcagcag tgtgaggaac 

24 3 tgaagggaat gactaccaag ctctttggtc 
244 taaaggtgct agaagaacaa ataaatgaag 
24 5 aaggcagaga atttgtgaaa aacgcatggg 

246 accgagactt aaaggaggcc ctcataagct 

247 agggaattca agtttggacc aatgctaagg 

248 tgaatttctc ttcaatcaaa gtgccccaaa 

249 agtcactacc tagtatacat aaacatatac 

250 cttaattgtg gtcgtgttaa tgtagcacaa 
2 51 taatttgagg ttttggggac tggtgctgat 
2 52 acagattgtt tgtcacgctt ctgaaccaat 

253 agatgtttgt ttcaagacca gctgtttcag 

254 attgaattcc tatatttact tggtaagatc 

255 ttgccttata gttgagctat ttggtttgac 
2 56 ctacgatttt cttctacatt cattgatgcc 
257 ccttcaattt gtgttttacc agaattttga 
2 58 agaagactca acttctcaga tctataatgg 

259 atagaaagtt ttaactgaat atgttacata 

260 aacctcagta atgagcctaa atttactctg 



attcataatg atgttatggc 
aaaataatta tgcctagtat 
tattttagta gattggtttt 
aataaagcat cctatatgag 



aaacagataa 1800 
ggagaaacag 1860 
cctttttttt 1920 
tttttaataa 1980 
1992 



ID No: 5215690CB1 



ctccggaccc 
ctgggctggg 
tcgacacgat 
cagaaaaacg 
ttgaaacaag 
ggcgatatag 
ttgtgccacc 
acatggatcc 
ttgcttcaca 
gtaactggaa 
aagcgcttaa 
actatagtga 
cagatcgact 
ggagtgccat 
tgtatgcatc 
aagagtatct 
agtatgactt 
tggtaactgg 
aagaaactcc 
gagaacaaca 
ctgatattga 
atgcagtcat 
aatgctttag 
acagaagcac 
aataagttaa 
aaaatatatt 
tccaaaaagt 
gactgaatgt 
atctattaat 
cactaatctg 
aaagctcagc 
ccttgtaatg 
tactggtcag 
gatacattgt 
taagaattaa 
cttggctctc 



cgagcggcag 
ggctgcggtc 
gacacacaat 
aactggaaga 
gtcagttgaa 
tgaatttgag 
tctgccagaa 
agattttgtg 
tcccatcctt 
ggagactgtg 
tgcaacattc 
tgaactgcag 
ctatggtgta 
agaaaaagaa 
ttctattgat 
tttttatgca 
ggagatggct 
gactgtgaga 
agagcagaga 
gctaaagtct 
acgcttcaaa 
gcagatcagt 
caagatgtaa 
aagtaaataa 
ataaattcag 
ttaatgaaga 
taatttaata 
caagatgttc 
gtagggaatt 
ttatagggag 
agaaacttga 
tttgcataca 
aaattttata 
catcctctag 
attcttctca 
tacacatggc 



ctccagccgg 
ggcaaggaag 
aatttttggt 
aatgccatga 
cataccgatg 
ttgttgagaa 
aaacgggcag 
gagaggcgac 
tgtagagaca 
aatgaaactg 
agagtgaaaa 
tctgtcatct 
tataaagtac 
atgggtgatg 
gatattttgg 
gaagcattgc 
gctcaggact 
acattctctt 
gaagccagaa 
aaaaatctgg 
gaacaaaaga 
atgtgcaaaa 
tcctgtgaat 
aagaaattta 
ctttctttaa 
ttaaatatta 
atatatacca 
gttaatttct 
tttccctaag 
ttgttttctc 
tgtgaaaaat 
ctagaaaatg 
ctgccaacaa 
caactcctat 
aataattctt 
atttcagggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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VERIFICATION SUMMARY DATE: 10/24/2002 

PATENT APPLICATION: US/09/744 , 313A TIME: 16:11:47 

Input Set : A:\pf0706usn_seqlist.txt 
Output Set: N:\CRF4\10242002\I744313A.raw 

:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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